Picture for Tong Wang

Tong Wang

Jeffrey

Hierarchically Decoupled Mixture-of-Experts for Robust Traffic Sign Recognition in Complex Driving Scenarios

Add code
Jun 01, 2026
Viaarxiv icon

StepAudio 2.5 Technical Report

Add code
May 22, 2026
Viaarxiv icon

Interpretable Discriminative Text Representations via Agreement and Label Disentanglement

Add code
May 20, 2026
Viaarxiv icon

MiVE: Multiscale Vision-language features for reference-guided video Editing

Add code
May 14, 2026
Viaarxiv icon

UHR-Micro: Diagnosing and Mitigating the Resolution Illusion in Earth Observation VLMs

Add code
May 12, 2026
Viaarxiv icon

ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving

Add code
May 06, 2026
Viaarxiv icon

See Further, Think Deeper: Advancing VLM's Reasoning Ability with Low-level Visual Cues and Reflection

Add code
Apr 27, 2026
Viaarxiv icon

DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering

Add code
Mar 10, 2026
Viaarxiv icon

CMSA-Net: Causal Multi-scale Aggregation with Adaptive Multi-source Reference for Video Polyp Segmentation

Add code
Feb 26, 2026
Viaarxiv icon

Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval

Add code
Feb 03, 2026
Viaarxiv icon